Feat/simple mse metric #388 #425

AnikethBhosale · 2025-10-31T18:01:02Z

Description

Implements MSE (Mean Squared Error) metric for Pruna's evaluation framework. The metric computes mean squared error between model predictions and ground truth values, accumulating results across batches using StatefulMetric pattern.

Related Issue

Fixes #388

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

How Has This Been Tested?

Created comprehensive test suite with 15 tests covering:
- Perfect match scenarios (MSE = 0)
- Known value calculations
- Multiple batch accumulation
- Empty state handling
- Multi-dimensional tensors (1D to 4D)
- Device compatibility (CPU/CUDA)
- Edge cases (None inputs, shape mismatches)
All tests pass: 14 passed, 1 skipped (CUDA test on non-CUDA systems)
Test coverage: 89% for metric_mse.py
Verified style compliance: ty check and ruff check pass

Checklist

My code follows the style guidelines of this project
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

Additional Notes

Metric is registered with MetricRegistry and can be used via Task(metrics=["mse"])
Follows existing patterns from other stateful metrics (e.g., SharpnessMetric)
Uses list-based state accumulation as required by StatefulMetric framework
Documentation includes usage examples, technical details, and related metrics

…er Understanding"

…sts and documentation

begumcig

Wow, this is already almost flawless, asked for some small changes but it is almost ready to be merged. Thanks a lot @AnikethBhosale

begumcig · 2025-11-07T16:41:18Z

src/pruna/evaluation/metrics/metric_mse.py

+            return
+
+        # Ensure tensors are on the same device
+        output_tensor = output_tensor.to(gt_tensor.device)


this is a great idea, that's why we have integrated device casting in the metric_data_processor. how do you feel about passing the device to it instead?

begumcig · 2025-11-07T16:41:38Z

src/pruna/evaluation/metrics/metric_mse.py

+            The model predictions/outputs.
+        """
+        # Process inputs based on call_type (returns tuple of tensors)
+        inputs = metric_data_processor(x, gt, outputs, self.call_type)


you can pass the device here (regarding the comment below)

begumcig · 2025-11-07T16:44:41Z

tests/evaluation/test_mse.py

@@ -0,0 +1,247 @@
+# Copyright 2025 - Pruna AI GmbH. All rights reserved.


I really like the variety in the tests! How do you feel about testing with some data from the pruna similar to what we have in tests/evaluation/test_torch_metrics.py?

begumcig · 2025-11-07T16:46:04Z

MSE_IMPLEMENTATION_SUMMARY.md

@@ -0,0 +1,200 @@
+# MSE Metric Implementation Summary


Thank you a lot for this detailed summary, are we planning on merging it to Pruna or is it more for giving information? I think this would be even more beneficial as the PR description

AnikethBhosale added 3 commits October 10, 2025 14:28

Added image in contributing.md for "Pruna AI's Working Logic For Easi…

2072d10

…er Understanding"

feat: implement Mean Squared Error metric for evaluation

ca411a0

feat: implement Mean Squared Error (MSE) metric with comprehensive te…

b44939c

…sts and documentation

begumcig self-requested a review November 7, 2025 12:43

begumcig requested changes Nov 7, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feat/simple mse metric #388 #425

Feat/simple mse metric #388 #425

Uh oh!

AnikethBhosale commented Oct 31, 2025

Uh oh!

begumcig left a comment

Uh oh!

begumcig Nov 7, 2025

Uh oh!

begumcig Nov 7, 2025

Uh oh!

begumcig Nov 7, 2025

Uh oh!

begumcig Nov 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		@@ -0,0 +1,247 @@
		# Copyright 2025 - Pruna AI GmbH. All rights reserved.

Feat/simple mse metric #388 #425

Are you sure you want to change the base?

Feat/simple mse metric #388 #425

Uh oh!

Conversation

AnikethBhosale commented Oct 31, 2025

Description

Related Issue

Type of Change

How Has This Been Tested?

Checklist

Additional Notes

Uh oh!

begumcig left a comment

Choose a reason for hiding this comment

Uh oh!

begumcig Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

begumcig Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

begumcig Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

begumcig Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants